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SEQUENCE LISTING 



NERAL INFORMATION: 



(i) 



APPLICANT: CAPUT, DANIEL 



FERRARA, PASCUAL 
GUILLEMOT, JEAN-CLAUDE 
KAGHAD, MOURAD 
LEGOUX, RICHARD 
LOISON, GERARD 
LARBRE, ELIZABETH 
LUPKER , JOHANNES 
LEPLATOIS, PASCUAL 
SALOME, MARK 



(ii) TITLE OF INVENTION: URATE OXIDASE ACTIVITY PROTEIN, 

RECOMBINANT GENE CODING THEREFOR, EXPRESSION VECTOR, 
MICRO-ORGANISMS AND TRANSFORMED CELLS 

(iii) NUMBER OF SEQUENCES: 35 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Foley & Lardner 

(B) STREET: 1800 Diagonal Road, Suite 500 

(C) CITY: Alexandria 

(D) STATE: Virginia 

(E) COUNTRY: USA 

(F) ZIP: 22313-0299 „. 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/659,408 

(B) FILING DATE: 25-APR-1991 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: BENT, Stephen A. 

(B) REGISTRATION NUMBER: 29,768 

(C) REFERENCE/DOCKET NUMBER: 16781/276 BEDL 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703)836-9300 

(B) TELEFAX: (703)683-4109 

(C) TELEX: 899149 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 301 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Urate oxidase 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Ser Ala Val Lys Ala Ala Arg Tyr Gly Lya Asp Asn Val Arg Val Tyr 
1 5 10 15 

Lys Val His Lys Asp Glu Lys Thr Gly Val Gin Thr Val Tyr Glu Met 
20 25 30 

Thr Val Cys Val Leu Leu Glu Gly Glu lie Glu Thr Ser Tyr Thr Lys 
35 40 45 

Ala Asp Asn Ser Val lie Val Ala Thr Asp Ser lie Lys Asn Thr lie 
50 55 60 

Tyr lie Thr Ala Lys Gin Asn Pro Val Thr Pro Pro Glu Leu Phe Gly 
65 70 75 80 

Ser lie Leu Gly Thr His Phe lie Glu Lys Tyr Asn His lie His Ala 
85 90 95 

Ala His Val Asn lie Val Cys His Arg Trp Thr Arg Met Asp lie Asp 
100 105 " 110 

Gly Lys Pro His Pro His Ser Phe lie Arg Asp Ser Glu Glu Lys Arg 
115 120 125 

Asn Val Gin Val Asp Val Val Glu Gly Lys Gly lie Asp lie Lys Ser 
130 135 140 

Ser Leu Ser Gly Leu Thr Val Leu Lys Ser Thr Asn Ser Gin Phe Trp 
145 150 155 160 

Gly Phe Leu Arg Asp Glu Tyr Thr Thr Leu Lys Glu Thr Trp Asp Arg 
165 170 175 

lie Leu Ser Thr Asp Val Asp Ala Thr Trp Gin Trp Lys Asn Phe Ser 
180 185 * 190 

Gly Leu Gin Glu Val Arg Ser His Val Pro Lys Phe Asp Ala Thr Trp 
195 200 205 

Ala Thr Ala Arg Glu Val Thr Leu Lys Thr Phe Ala Glu Asp Asn Ser 
210 215 220 

Ala Ser Val Gin Ala Thr Met Tyr Lys Met Ala Glu Gin He Leu Ala 
225 230 235 240 

Arg Gin Gin Leu He Glu Thr Val Glu Tyr Ser Leu Pro Asn Lys His 
245 250 255 

Tyr Phe Glu He Asp Leu Ser Trp His Lys Gly Leu Gin Asn Thr Gly 
260 265 270 

Lys Asn Ala Glu Val Phe Ala Pro Gin Ser Asp Pro Asn Gly Leu lie 
275 280 285 

Lys Cys Thr Val Gly Arg Ser Ser Leu Lys Ser Lys Leu 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 302 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Met-Urate oxidase 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ser Ala Val Lys Ala Ala Arg Tyr Gly Lys Asp Asn Val Arg Val 
1 5 10 15 

Tyr Lys Val His Lys Asp Glu Lys Thr Gly Val Gin Thr Val Tyr Glu 
20 25 30 

Met Thr Val Cys Val Leu Leu Glu Gly Glu lie Glu Thr Ser Tyr Thr 
35 40 45 

Lys Ala Asp Asn Ser Val lie Val Ala Thr Asp Ser lie Lys Asn Thr 
50 55 60 

lie Tyr lie Thr Ala Lys Gin Asn Pro Val Thr Pro Pro Glu Leu Phe 
65 70 75 80 

Gly Ser lie Leu Gly Thr His Phe lie Glu Lys Tyr Asn His lie His 
85 90 * 95 

Ala Ala His Val Asn lie Val Cys His Arg Trp Thr Arg Met Asp lie 
100 105 110 

Asp Gly Lys Pro His Pro His Ser Phe lie Arg Asp Ser Glu Glu Lys 
115 120 125 

Arg Asn Val Gin Val Asp Val Val Glu Gly Lys Gly lie Asp lie Lys 
130 135 140 

Ser Ser Leu Ser Gly Leu Thr Val Leu Lys Ser Thr Asn Ser Gin Phe 
145 150 155 160 

Trp Gly Phe Leu Arg Asp Glu Tyr Thr Thr Leu Lys Glu Thr Trp Asp 
165 170 175 

Arg lie Leu Ser Thr Asp Val Asp Ala Thr Trp Gin Trp Lys Asn Phe 
180 185 190 

Ser Gly Leu Gin Glu Val Arg Ser His Val Pro Lys Phe Asp Ala Thr 
195 200 205 

Trp Ala Thr Ala Arg Glu Val Thr Leu Lys Thr Phe Ala Glu Asp Asn 
210 215 220 

Ser Ala Ser Val Gin Ala Thr Met Tyr Lys Met Ala Glu Gin lie Leu 
225 230 * 235 240 

Ala Arg Gin Gin Leu lie Glu Thr Val Glu Tyr Ser Leu Pro Asn Lys 
245 250 " 255 

His Tyr Phe Glu lie Asp Leu Ser Trp His Lys Gly Leu Gin Asn Thr 
260 265 270 

Gly Lys Asn Ala Glu Val Phe Ala Pro Gin Ser Asp Pro Asn Gly Leu 
275 280 285 

lie Lys Cys Thr Val Gly Arg Ser Ser Leu Lys Ser Lys Leu 
290 295 300 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Preferred sequence for expression in 
prokaryotes 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ATGTCTGCGG 


TAAAAGCAGC 


GCGCTACGGC 


AAGGACAATG 


TTCGCGTCTA 


CAAGGTTCAC 


60 


AAGGACGAGA 


AGACCGGTGT 


CCAGACGGTG 


TACGAGATGA 


CCGTCTGTGT 


GCTTCTGGAG 


120 


GGTGAGATTG 


AGACCTCTTA 


CACCAAGGCC 


GACAACAGCG 


TCATTGTCGC 


AACCGACTCC 


180 


ATTAAGAACA 


CCATTTACAT 


CACCGCCAAG 


CAGAACCCCG 


TTACTCCTCC 


CGAGCTGTTC 


240 


GGCTCCATCC 


TGGGCACACA 


CTTCATTGAG 


AAGTACAACC 


ACATCCATGC 


CGCTCACGTC 


300 


AACATTGTCT 


GCCACCGCTG 


GACCCGGATG 


GACATTGACG 


G C AAGCC AC A 


CCCTCACTCC 


360 


TTCATCCGCG 


ACAGCGAGGA 


GAAGCGGAAT 


GTGCAGGTGG 


ACGTGGTCGA 


GGGCAAGGGC 


420 


ATCGATATCA 


AGTCGTCTCT 


GTCCGGCCTG 


ACCGTGCTGA 


AGAGCACCAA 


CTCGCAGTTC 


480 


TGGGGCTTCC 


TGCGTGACGA 


GTACACCACA 


CTTAAGGAGA 


CCTGGGACCG 


TATCCTGAGC 


540 


ACCGACGTCG 


ATGCCACTTG 


GCAGTGGAAG 


AATTTCAGTG 


GACTCCAGGA 


GGTCCGCTCG 


600 


CACGTGCCTA 


AGTTCGATGC 


TACCTGGGCC 


ACTGCTCGCG 


AGGTCACTCT 


GAAGACTTTT 


660 


GCTGAAGATA 


ACAGTGCCAG 


CGTGCAGGCC 


ACTATGTACA 


AGATGGCAGA 


GCAAATCCTG 


720 


GCGCGCCAGC 


AGCTGATCGA 


GACTGTCGAG 


TACTCGTTGC 


CTAACAAGCA 


CTATTTCGAA 


780 


ATCGACCTGA 


GCTGGCACAA 


GGGCCTCCAA 


AACACCGGCA 


AGAACGCCGA 


GGTCTTCGCT 


840 


CCTCAGTCGG 


ACCCCAACGG 


TCTGATCAAG 


TGTACCGTCG 


GCCGGTCCTC 


TCTGAAGTCT 


900 


AAATTG 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Preferred sequence for expression in 
eukaryotes 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 





1 XAALrGCTGC 


m Ti n rn 7k m 

TAGATACGGT 


7k 7k ^7 ^7 7k ^7 m « ^mjm 

AAGGACAACG 


TTAGAGTCTA 


CAAGGTTCAC 


60 




AGACCGGTGT 


CCAGACGGTG 


TACGAGATGA 


CCGTCTGTGT 


GCTTCTGGAG 


120 


GG1.GAGA1 lb 


AGACCTCTTA 


^7 7k ^1^7 7k Ik ^7 #7 **** 

CACCAAGGCC 


GACAACAGCG 


TCATTGTCGC 


AACCGACTCC 


180 


ATTAAGAACA 


CCATTTACAT 


CACCGCCAAG 


CAGAACCCCG 


TTACTCCTCC 


CGAGCTGTTC 


240 


GGCTCCATCC 


TGGGCACACA 


CTTCATTGAG 


AAGTACAACC 


ACATCCATGC 


CGCTCACGTC 


300 


AACATTGTCT 


GCCACCGCTG 


GACCCGGATG 


GACATTGACG 


GCAAGCCACA 


CCCTCACTCC 


360 


TTCATCCGCG 


ACAGCGAGGA 


GAAGCGGAAT 


GTGCAGGTGG 


ACGTGGTCGA 


GGGCAAGGGC 


420 


TV rp/"^/-* J* T7 TV T»^7TV 

ATCGATATCA 


AGTCGTCTCT 


GTCCGGCCTG 


ACCGTGCTGA 


AGAGCACCAA 


CTCGCAGTTC 


480 


TGGGGCTTCC 


nisi f\ ^7^7 

TGCGTGACGA 


GTACACCACA 


CTTAAGGAGA 


CCTGGGACCG 


TATCCTGAGC 


540 


ACC6ACGTCG 


ATGCCACTTG 


GCAGTGGAAG 


AATTTCAGTG 


GACTCCAGGA 


GGTCCGCTCG 


600 


CACGTGCCTA 


AGTTCGATGC 


TACCTGGGCC 


ACTGCTCGCG 


AGGTCACTCT 


GAAGACTTTT 


660 


GCTGAAGATA 


ACAGTGCCAG 


CGTGCAGGCC 


ACTATGTACA 


AGATGGCAGA 


GCAAATCCTG 


720 


GCGCGCCAGC 


AGCTGATCGA 


GACTGTCGAG 


TACTCGTTGC 


CTAACAAGCA 


CTATTTCGAA 


780 


ATCGACCTGA 


G CTGGC AC AA 


GGGCCTCCAA 


AACACCGGCA 


AGAACGCCGA 


GGTCTTCGCT 


840 


CCTCAGTCGG 


ACCCCAACGG 


TCTGATCAAG 


TGTACCGTCG 


GCCGGTCCTC 


TCTGAAGTCT 


900 


AAATTG 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Preferred non-translated 5' sequence for 
animal cells 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AGCTTGCCGC CACT 14 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: Preferred sequence for expression in animal 
cells 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



ATGTCCGCAG 


TAAAAGCAGC 


CCGCTACGGC 


AAGGACAATG 


TCCGCGTCTA 


CAAGGTTCAC 


60 


AA6GACGAGA 


AGACCGGTGT 


CCAGACGGTG 


TACGAGATGA 


CCGTCTGTGT 


GCTTCTGGAG 


120 


GGTGAGATTG 


AGACCTCTTA 


CACCAAGGCC 


GACAACAGCG 


TCATTGTCGC 


AACCGACTCC 


180 


ATTAAGAACA 


CCATTTACAT 


CACCGCCAAG 


CAGAACCCCG 


TTACTCCTCC 


CGAGCTGTTC 


240 


GGCTCCATCC 


TGGGCACACA 


CTTCATTGAG 


AAGTACAACC 


ACATCCATGC 


CGCTCACGTC 


300 


AACATTGTCT 


GCCACCGCTG 


GACCCGGATG 


GACATTGACG 


GCAAGCCACA 


CCCTCACTCC 


360 


TTCATCCGCG 


ACAGCGAGGA 


GAAGCGGAAT 


GTGCAGGTGG 


ACGTGGT CG A 


GGGCAAGGGC 


420 


ATCGATATCA 


AGTCGTCTCT 


GTCCGGCCTG 


ACCGTGCTGA 


AGAGCACCAA 


CTCGCAGTTC 


480 


TGGGGCTTCC 


TGCGTGACGA 


GTACACCACA 


CTTAAGGAGA 


CCTGGGACCG 


TATCCTGAGC 


540 


ACCGACGTCG 


ATGCCACTTG 


GCAGTGGAAG 


AATTTCAGTG 


GACTCCAGGA 


GGTCCGCTCG 


600 


CACGTGCCTA 


AGTTCGATGC 


TACCTGGGCC 


ACTGCTCGCG 


AGGTCACTCT 


GAAGACTTTT 


660 


GCTGAAGATA 


ACAGTGCCAG 


CGTGCAGGCC 


ACTATGTACA 


AGATGGCAGA 


GCAAATCCTG 


720 


GCGCGCCAGC 


AGCTGATCGA 


GACTGTCGAG 


TACTCGTTGC 


CTAACAAGCA 


CTATTTCGAA 


780 


ATCGACCTGA 


GCTGGCACAA 


GGGCCTCCAA 


AACACCGGCA 


AGAACGCCGA 


GGTCTTCGCT 


840 


CCTCAGTCGG 


ACCCCAACGG 


TCTGATCAAG 


TGTACCGTCG 


GCCGGTCCTC 


TCTGAAGTCT 


900 


AAATTG 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: reverse transcription primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GATCCGGGCC CTTTTTTTTT TTT 23 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product T 17 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Asn Val Gin Val Asp Val Val Glu Gly Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product T 20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Asn Phe Ser Gly Leu Gin Glu Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product T 23 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Phe Asp Ala Thr Trp Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product T 27 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

His Tyr Phe Glu lie Asp Leu Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product T 28 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

lie Leu Ser Thr Asp Val Asp Ala Thr Trp Gin Trp Lys 
1 5 * 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product T 29 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

His Tyr Phe Glu lie Asp Leu Ser Trp His Lys 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product T 31 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ser Thr Asn Ser Gin Phe Trp Gly Phe Leu Arg 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product T 32 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gin Asn Pro Val Thr Pro Pro Glu Leu Phe Gly Ser lie Leu Gly Thr 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product T 33 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Gin Asn Pro Val Thr Pro Pro Glu Leu Phe Gly Ser lie Leu Gly Thr 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product V 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Tyr Ser Leu Pro Asn Lys Ala Asp Asn Ser Val lie Val Asp Thr Asp 
15 10 15 

Ser lie Lys Asn Thr lie Tyr lie Thr 
20 25 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product V 2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Val Thr Leu Lys Thr Phe Ala Glu Asp Asn Ser Ala Ser Val Gin Ala 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product V 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Thr Ser Tyr Thr Lys Ala Asp Asn Ser Val lie Val Asp Thr Asp Ser 
1 5 10 15 

lie Lys Asn Thr lie Tyr lie Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product V 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Gly Lys Gly lie Asp lie Lys Ser Ser Leu Ser Gly Leu Thr Val Leu 
15 10 15 

Lys Ser Thr Asn Ser Gin Phe Trp Gly Phe Leu Arg 
20 25 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydolysis product V 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Gly Lys Gly lie Asp lie Lys Ser Ser Leu Ser Gly Leu Thr Val Leu 
15 10 ** 15 

Lys 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Fragment 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GATCCGCGGA AGCATAAAGT GTAAAGCCTG GGGTGCCTAA TGAGTGAGCT AACTTACATT 60 

AATTGCGTTG CGCTCACTGC CCGCTTTCCA GTCGGGAAAC CTGTCGTGCC AGCTGCATTA 120 

ATGAATCGGC CAACGCGCGG GGAGAGGCGG TTTGCGTATT GGGCGCCAGG GTGGTTTTTC 180 

TTTTCACCAG TGAGACGGGC AACAGCTGAT TGCCCTTCAC CGCCTGGCCC TGAGAGAGTT 240 
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GCAGCAAGCG 


GTCCACGCTG 


GTTTGCCCCA 


CCACCCGAAA 


ATCCTGTTTG 


ATGGTGGTTA 


300 


ACGGCGGGAT 


ATAACATGAG 


CTGTCTTCGG 


TATCGTCGTA 


TCCCACTACC 


GAGATATCCG 


360 


CACCAACGCG 


CAGCCCGGAC 


TCGGTAATGG 


CGCGCATTGC 


GCCCAGCGCC 


ATCTGATCGT 


420 


TGGCAACCAG 


CATCGCAGTG 


GGAACGATGC 


CCTCATTCAG 


CATTTGCATG 


GTTTGTTGAA 


480 


AACCGGACAT 


GGCACTCCAG 


TCGCCTTCCC 


GTTCCGCTAT 


CGGCTGAATT 


TGATTGCGAG 


540 


TGAGATATTT 


ATGCCAGCCA 


GCCAGACGCA 


GACGCGCCGA 


GACAGAACTT 


AATGGGCCCG 


600 


CTAACAGCGC 


GATTTGCTGG 


TGACCCAATG 


CGACCAGATG 


CTCCACGCCC 


AGTCGCGTAC 


660 


CGTCTTCATG 


GGAGAAAATA 


ATACTGTTGA 


TGGGTGTCTG 


GTCAGAGACA 


TCAAGAAATA 


720 


ACGCCGGAAC 


ATTAGTGCAG 


GCAGCTTCCA 


CAGCAATGGC 


ATCCTGGTCA 


TCCAGCGGAT 


780 


AGTTAATGAT 


CAGCCCACTG 


ACGCGTTGCG 


CGAGAAGATT 


GTGCACCGCC 


GCTTTACAGG 


840 


CTTCGACGCC 


GCTTCGTTCT 


ACCATCGACA 


CCACCACGCT 


GGCACCCAGT 


TGATCGGCGC 


900 


GAGATTTAAT 


CGCCGCGACA 


ATTTGCGACG 


GCGCGTGCAG 


GGCCAGACTG 


GAGGTGGCAA 


960 


CGCCAATCAG 


CAACGACTGT 


TTGCCCGCCA 


GTTGTTGTGC 


CACGCGGTTG 


GGAATGTAAT 


1020 


TCAGCTCCGC 


CATCGCCGCT 


TCCACTTTTT 


CCCGCGTTTT 


CGCAGAAACG 


TGGCTGGCCT 


1080 


GGTTCACCAC 


GCGGGAAACG 


GTCTGATAAC 


AGACACCGGC 


ATACTCTGCG 


ACATCGTATA 


1140 


ACGTTACTGG 


TTTCACATTC 


ACCACCCTGA 


ATTGACTCTC 


TTCCGGGCGC 


TATCATGCCA 


1200 


TACCGCGAAA 


GGTTTTGCGC 


CATTCGATGG 


TGTCCG 






1236 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Fragment 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

TCGAGCTGAC TGACCTGTTG CTTATATTAC ATCGATAGCT TAGCGTATAA TGTGTGGAAT 60 

TGTGAGCGAT AACAATTTCA CACAGTTTAA CTTTAAGAAG GAGATATACA TATGGCTACC 120 

GGATCCCGGA CTAGTCTGCT CCTGGCT TTT GGCCTGCTCT GCCTGCCCTG GCTTCAAGAG 180 

GGCAGTGCCT TCCCAACCAT TCCCTTATCT AGACTTTTTG ACAACGCTAT GCTCCGCGCC 240 

CATCGTCTGC ACCAGCTGGC CTTTGACACC TACCAGGAGT TTGAAGAAGC CTATATCCCA 300 

AAGGAACAGA AGTATTCATT CCTGCA 326 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Clal-Ndel fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CGATAGCGTA TAATGTGTGG AATTGTGAGC GGATAACAAT TTCACACAGT TTTTCGCGAA 60 
GAAGGAGATA TACA 74 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 base pairs 

(B) TYPE: nucleic acid 

(C) '" STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHET I CAL : NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Synthetic hGH gene end fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GATCTTCAAG CAGACCTACA GCGACGCACT ACTCAAGAAC TACGGGCTGC TCTACTGCTT 60 
CAGGAAGGAC ATGGACAAGG TCGAGACATT CCTGCGCATC GTGCAGTGCC GCTCTGTGGA 120 
GGGCAGCTGT GGCTTCTAGT AAGGTACCCT GCCCTACGTA CCA 163 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Ndel-AccII synthetic fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TATGTCTGCG GTAAAAGCAG CGCGCTACGG CAAGGACAAT GTTCGCGT 48 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Plasmid pEMR469 fragment with ADH2 promoter 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 



GGGACGCGTC 


TCCTCTGCCG 


GAACACCGGG 


CATCTCCAAC 


TTATAAGTTG 


GAGAAATAAG 


60 


AGAATTTCAG 


ATTGAGAGAA 


TGAAAAAAAA 


AAAAAAAAAA 


AAGGCAGAGG 


AGAGCATAGA 


120 


AATGGGGTTC 


ACTTTTTGGT 


AAAGCTATAG 


CATGCCTATC 


ACATATAAAT 


AGAGTGCCAG 


180 


TAGCGACTTT 


TTTCACACTC 


GAGATACTCT 


TACTACTGCT 


CTCTTGTTGT 


TTTTATCACT 


240 


TCTTGTTTCT 


TCTTTGGTAA 


ATAGAATATC 


AAGCTACAAA 


AAGCATACAA 


TCAACTATCA 


300 


ACTATTAACT 


ATATCGATAC 


CATATGGATC 


CGTCGACTCT 


AGAGGATCGT 


CGACTCTAGA 


360 


G 












361 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Fragment C 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CGATATACAC AATGTCTGCT GTTAAGGCTG CTAGATACGG TAAGGACAAC GTTAGAGT 58 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1013 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Fragment D 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CTACAAGGTT CACAAGGACC ACAAGACCGG TGTCCAGACG GTGTACGAGA TGACCGTCTG 60 

TGTGCTTCTG GAGGGTGAGA TTGAGACCTC TTACACCAAG GCCGACAACA GCGTCATTGT 120 

CGCAACCGAC TCCATTAAGA ACACCATTTA CATCACCGCC AAGCAGAACC CCGTTACTCC 180 

TCCCGAGCTG TTCGGCTCCA TGCTGGGCAC ACACTTTCAT TGAGAAGTAC AACACATCCA 240 

TGCCGCTCAC GTCAACATTG TCTGCCACCG CTGGACCCGG ATGGACATTG ACGGCAAGCC 300 

ACACCCTCAC TCCTTCATCC GCGACAGCGA GGAGAAGCGG AATGTGCAGG TGGACGTGGT 360 

CGAGGGCAAG GGCATCGATA TCAAGTCGTC TCTGTCCGGC CTGACCGTGC TGAAGAGCAC 420 

CAACTCGCAG TTCTGGGGCT TCCTGCGTGA CGAGTACACC ACACTTAAGG AGACCTGGGA 480 

CCGTATCCTG AGCACCGACG TCGATGCCAC TTGGCAGTGG AAGAATTTCA GTGGACTCCA 540 

GGAGGTCCGC TCGCACGTGC CTAAGTTCGA TGCTACCTGG GCCACTGCTC GCGAGGTCAC 600 

TCTGCCGACT TTTGCTGAAG ATAACAGTGC CAGCGTGCAG GCCACTATGT ACAAGATGGC 660 

AGAGCAAATC CTGGCGCGCC AGCAGCTGAT CGAGACTGTC GAGTACTCGT TGCCTAACAA 720 

GCACTATTTC GAAATCGACC TGAGCTGGCA CAAGGGCCTC CAAAACACCG GCAAGAACGC 780 

CGAGGTCTTC GCTCCTCAGT CGGACCCCAA CGGTCTGATC AAGTGTACCG TCGGCCGGTC 840 

CTCTCTGAAG TCTAAATTGT AAACCAACAT GATTCTCACG TTCCGGAGTT TCCAAGGCAA 900 

ACTGTATATA GTCTGGGATA GGGTATAGCA TTCATTCACT TGTTTTTTAC TTCCAAAAAA 960 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAGGGC CCG 1013 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Synthetic GAL 7 fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CGCGTCTATA CTTCGGAGCA CTGTTGAGCG AAGGCTCATT AGATATATTT TCTGTCATTT 60 

TCCTTAACCC AAAAATAAGG GAGAGGGTCC AAAAAGCGCT CGGACAACTG TTGACCGTGA 120 

TCCGAAGGAC TGGCTATACA GTGTTCACAA AATAGCCAAG CTGAAAATAA TGTGTAGCCT 180 

TTAGCTATGT TCAGTTAGTT TGGCATG 207 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Modified Xbal-Mlul adapter 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CTAGGCTAGC GGGCCCGCAT GCA 23 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Plasmid pSEl "site binding to Hindlll" 
fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
AGCTGGCTCG CATCTCTCCT TCACGCGCCC GCCGCCCTAC CTGAGGCCGC CATCCACGCC 60 
GGTGAGTCGC GTTCTGCCGC CTCCCGCCTG TGGTGCCTCC TGAACTGCGT CCGCCGTCTA 120 
GGTAGGCTCC AAGGGAGCCG GACAAAGGCC CGGTCT CG AC CTGAGCTCTA AACTTACCTA 180 
GACTCAGCCG GCTCTCCACG CTTTGC CTG A CCCTGCTTGC TCAACTCTAC GTCTTTGTTT 240 
CGTTTTCTGT TCTGCGCCGT TACAACTTCA AGGTATGCGC TGGGACCTGG CAGGCGGCAT 300 
CTGGGACCCC TAGGAAGGGC TTGGGGGTCC TCGTGCCCAA GGCAGGGAAC ATAGTGGTCC 360 
CAGGAAGGGG AGCAGAGGCA TCAGGGTGTC CACTTTGTCT CCGCAGCTCC TGAGCCTGCA 420 
GA 422 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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1 





(vii) IMMEDIATE SOURCE: 

(B) CLONE: Synthetic HindIII-"site binding to BamHI" 
fragment 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



AGCTTGTCGA CTAATACGAC TCACTATAGG GCGGCCGCGG GCCCCTGCAG GAATTCGGAT 



60 



CCCCCGGGTG ACTGACT 



77 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Synthetic Hindlll-AccI fragment 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
AGCTTGCCGC CACTATGTCC GCAGTAAAAG CAGCCCGCTA CGGCAAGGAC AATGTCCGCG 60 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hindlll-SnaBI fragment 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



AGCTTGCCGC 


CACTATGTCC 


GCAGTAAAAG 


CAGCCCGCTA 


CGGCAAGGAC 


AATGTCCGCG 


60 


TCTACAAGGT 


TCACAAGGAC 


GAGAAGACCG 


GTGTCCAGAC 


GGTGTACGAG 


ATGACCGTCT 


120 


GTGTGCTTCT 


GGAGGGTGAG 


ATTGAGACCT 


CTTACACCAA 


GGCCGACAAC 


AGCGTCATTG 


180 


TCGCAACCGA 


CTCCATTAAG 


AACACCATTT 


ACATCACCGC 


CAAGCAGAAC 


CCCGTTACTC 


240 


CTCCCGAGCT 


GTTCGGCTCC 


ATCCTGGGCA 


CACACTTCAT 


TGAGAAGTAC 


AACCACATCC 


300 


ATGCCGCTCA 


CGTCAACATT 


GTCTGCCACC 


GCTGGACCCG 


GATGGACATT 


GACGGCAAGC 


360 


CACACCCTCA 


CTCCTTCATC 


CGCGACAGCG 


AGGAGAAGCG 


GAATGTGCAG 


GTGGACGTGG 


420 


TCGAGGGCAA 


GGGCATCGAT 


ATCAAGTCGT 


CTCTGTCCGG 


CCTGACCGTG 


CTGAAGAGCA 


480 



T 



61 
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CCAACTCGCA 


GTTCTGGGGC 


TTCCTGCGTG 


ACGAGTACAC 


CACACTTAAG 


GAGACCTGGG 


540 


ACCGTATCCT 


GAGCACCGAC 


GTCGATGCCA 


CTTGGCAGTG 


GAAGAATTTC 


AGTGGACTCC 


600 


AGGAGGTCCG 


CTCGCACGTG 


CCTAAGTTCG 


ATGCTACCTG 


GGCCACTGCT 


CGCGAGGTCA 


660 


CTCTGAAGAC 


TTTTGCTGAA 


GATAACAGTG 


CCAGCGTGCA 


GGCCACTATG 


TACAAGATGG 


720 


CAGAGCAAAT 


CCTGGCGCGC 


CAGCAGCTGA 


TCGAGACTGT 


CGAGTACTCG 


TTGCCTAACA 


780 


AGCACTATTT 


CGAAATCGAC 


CTGAGCTGGC 


ACAAGGGCCT 


CCAAAACACC 


GGCAAGAACG 


840 


CCGAGGTCTT 


CGCTCCTCAG 


TCGGACCCCA 


ACGGTCTGAT 


CAAGTGTACC 


GTCGGCCGGT 


900 


CCTCTCTGAA 


GTCTAAATTG 










920 
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